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Abstract 

We consider N x N symmetric random matrices where the probability distribution for each matrix 
element is given by a measure v with a subexponential decay. We prove that the eigenvalue spacing 
statistics in the bulk of the spectrum for these matrices and for GOE are the same in the limit N — > oo. 
Our approach is based on the study of the Dyson Brownian motion via a related new dynamics, the local 
relaxation flow. 
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1 Introduction 

A central question concerning random matrices is the universality conjecture which states that local statistics 
of eigenvalues are determined by the symmetries of the ensembles but are otherwise independent of the details 
of the distributions. There are two types of universalities: the edge universality and the bulk universality 
concerning the interior of the spectrum. The edge universality is commonly approached via the moment 
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method [20, 21] while the bulk universality was proven for very general classes of unitary invariant ensembles 
(see, e.g. [3, 5, 6, 18, 19] and references therein) based on detailed analysis of orthogonal polynomials. For 
non-unitary ensembles, recent results by Tao-Vu and the current authors [11, 22, 12] have established the 
bulk universality for Hermitian Wigner ensembles. The approaches of [11] and [22] are based on the following 
two common ingredients: (1) Johansson's [16] result (see also [2]) stating that bulk universality holds for 
ensembles of the form 

H + sV, (1.1) 

where H is a Wigner matrix, V is an independent standard GUE matrix and s is a positive constant. (2) 
For a random matrix H , find a random matrix H so that the eigenvalue statistics between H and H + sV 
are near each other. 

Johansson's proof relied on the asymptotic analysis of an explicit formula by Brczin-Hikami [4, 16] for 
the correlation functions of the eigenvalues of H + sV . Unfortunately, the similar formula for GOE is not 
very explicit and the corresponding result is not available. On the other hand, if we vary the parameter 
s in the matrix H + sV and s 2 is interpreted as time, then the evolution of the eigenvalues is the Dyson 
Brownian motion (DBM) [7]. If we replace the Brownian motions by the Ornstein-Uhlenbeck processes, the 
resulting dynamics on the eigenvalues, which we still call DBM, has the GUE eigenvalue distribution as the 
invariant measure. Thus the result of Johansson can be interpreted as stating that the local statistics of 
GUE is reached via DBM for time of order one. In fact, by analyzing the dynamics of DBM with ideas from 
the hydrodynamical limit, we have extended Johansson's result to s 2 ^> iV~ 3 / 4 [10]. The key observation 
of [10] is that the local statistics of eigenvalues depend exclusively on the approach to local equilibrium. 
This approach avoids the usage of explicit formulae for correlation functions, but the identification of local 
equilibria, unfortunately, still uses explicit representations of correlation functions by orthogonal polynomials 
(following e.g. [19]), and the extension to other ensembles is not a simple task. 

Therefore, the universality for symmetric random matrices remained open and the only partial result is 
given by Tao-Vu (Theorem 23 in [22]) for Wigner matrices with the first four moments of the matrix elements 
matching those of GOE. In this paper, we introduce a dynamical approach based on a new flow, the local 
relaxation flow, which locally behaves like DBM, but has a faster decay to equilibrium. In this approach the 
analysis of orthogonal polynomials or explicit formulae are completely eliminated and the method applies to 
both Hermitian and symmetric ensembles. In fact, the heart of the proof is a convex analysis and it applies 
to /3-cnscmbles for any /3 > 1. The model specific information required to complete this approach involves 
only rough estimates on the accuracy of the local density of states. We expect this method will apply to a 
very general class of models. 

2 Statement of Main Results 

To fix the notation, we will present the case of symmetric Wigner matrices; the modification to the Hermitian 
case is straightforward and will be omitted. Fix N 6 N and we consider a symmetric matrix ensemble of 
N x N matrices H = (hgk) with normalization given by 

h tk = N-^xtk, (2.1) 

where xg^ for £ < k are independent, identically distributed random variables with the distribution v that 
has zero expectation and variance 1. The diagonal elements xu are also i.i.d. with distribution v that has 
zero expectation and variance two. 
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The joint distributions of the eigenvalues x = (x\, X2, ■ ■ ■ , Xn) of the Gaussian Unitary Ensemble (GUE) 
and the Gaussian Orthogonal Ensemble (GOE) are given by the following measure 



e -H(x) 
H = A*w(dx) = — dx, 

z 



W(x) = N 



N 



pJ2 



4 



N 



(2.2) 



where /3 = 1 for GOE and j3 = 2 for GUE. We consider /1 defined on the set of ordered eigenvalues 

S N := {x £ K w : x 1 < x 2 < . ■ ■ < x N }. 



For definiteness, consider the (3 = 1 GOE case. Suppose the matrix elements evolve according to the 
Ornstein-Uhlenbeck (OU) process on R, i.e. the density of their distribution v t = u t (x)dx satisfies 



1 d 2 



d 



dtUt =J?U t , J? =-—--- 



(2.3) 



with the reversible measure e~ x l 2 Ax (strictly speaking, a differently normalized OU process is used for the 
diagonal elements but we omit this detail here). The Ornstein-Uhlenbeck process (2.3) induces a stochastic 
process, the Dyson Brownian motion, on the eigenvalues with a generator given by 



JV N 



4 Xi 



9 AT 



2N '■ — ' Xi — Xj 



acting on L 2 (^i). Let 



r N 1 r 

D(f) = / fLfdfi = £ — / (d 3 f) 2 dv 



(2.4) 



(2.5) 



be the corresponding Dirichlet form, where dj = d Xj . Clearly (i is a reversible measure for the dynamics 
generated by L. Denote the distribution of the eigenvalues at time t by / t (x)/i(dx). Then f t satisfies 



dtft = Lf t . 



(2.6) 



The corresponding stochastic differential equation for the eigenvalues x(i) is now given by (see, e.g. Section 
12.1 of [141) 



dx,; 



2 AT 



1 



2N i — ' Xi — Xj 



dt. 



l<i<N, 



(2.7) 



where {Bi : 1 < i < N} is a collection of independent Brownian motions. The wcll-posedncss of DBM has 
been proved in Section 4.3.1 of [14], see the Appendix for more details. 

Note that the equations (2.6) and (2.7) are defined for any ft > 1, independently of the original matrix 
models. Our main Theorem 2.1 will be stated for general /3 > 1. 

Convention. We will use the letters C and c to denote general constants whose precise values are irrelevant 
and they may change from line to line. 
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2.1 Local Relaxation Flow 



We now introduce the local relaxation measure, which has the local statistics of GOE (or GUE) but has a 
faster decaying dynamics. Recall that 



1 

2^ 



V(4 



g sc (x)dx, 



(2. 



Qsc{x) := 

is the density of the semicircle law. Define 

n sc (E) :-- 
and let 

7i=n- 1 (i/iV). (2.9) 

Fix a positive number rj with TV -1 / 6 « i; « 1 and for the rest of this paper, e > is a small positive 
number which we will not specify. Let 7^ := jj ± ijN~ E and define the mean field potential of eigenvalues 
far away from the j-th one as 



Wj(x) 



L 

N 



E lo %(\ x ~ + ry) if x e Ij := {j 3 , 7/). 

k:\k~j\>N V 



(2.10) 



For x > 7^", we extend VF, by 



IF 



/ \ FF"(7 + ) / x 

(*) = ^(7+) + Wjilf) y° 7;) + — - 7;) 



(2.11) 



and similarly for a; < 77. In other words, VFj is just the simplest convex extension of the function defined by 
(2.10) on Ij. Fhis modification will avoid the singularities at x = 7^. Notice that this is purely a technical 
device since we will show in (4.17) of Proposition 4.4 that the regime Ij is negligible in the sense that 



N 

£ / f t \(x^IjW<Ce- cN ' . 
3=1 J 



(2.12) 



The local relaxation measure ujn = u is defined by 



uj = — exp 
Z 



Let 



JV 



j'=i 



E H logflari-Zjl +77) 



Z 



■ exp 



- H 



z 



N 



Y E togilxi-Xjl + ri-N^Wii? 

i=l j:\j-i\>Nr) i=l 



Recall the definition of the relative entropy with respect to a measure A 

Sx(f) = I /log/dA, S x (f\i>) = //log(//#lA 
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and the Dirichlet form 



Dx(h) = (Vh) 2 d\=^J2j ^ dA ■ ( 2 - 13 ) 



The local relaxation flow is defined to be the reversible dynamics w.r.t. u> characterized by the generator 
L defined by 

f fLgdu = ]T I djfdjgdLO (2.14) 

3 J 

Explicitly, L is given by 

L = Z + 5>fl„ 6, = i Y, Z { X\7l +W '^ (2J5) 

j ^ k:\k-j\>Nr,^ Xkl+V 

where 

W^) = ~ E f gn( ' T ' 7fc) (2-16) 

for x £ Ij. Note that for any fc with \k — j| > ./V77, we have 7^ ^ 2/j, where 21 j is the doubling of the interval 
Ij. Moreover, for k = j ± N-q we have |7fc — 7j| < Crj 2 ^ with some constant C, so |a; — 7fc| < C77 2 / 3 for 
x G Ij . Thus we obtain that 



inf W'ix) > c inf / 
x€lj \x\<2+vJ\ x 



0sc(l)d-/ 1/3 

I 1 9 — L '/ 

|2;-7|>Crp/3 |a;-7^ 



Because Wj was defined by a convex extension outside of Ij, the same bound holds for any x: 

inf inf W"{x) > crf lj3 , (2.17) 

i.e., the mean field potential is uniformly convex. 
We can now state the main result. 

Theorem 2.1 (Universality of Dyson Brownian Motion for Short Time) Suppose that S^/olVO < 
CN m for some m fixed. Let r = ?7 1//3 iV e with some e > and assume that r\ > iV _1 / 33+e . Assume that there 
is a positive number A smc/i i/ia£ 

supiV£ / tfftdn < Cn- 2 A. (2.18) 

t<r . J 

Fix n > 1, let G : l n — >■ M 6e a bounded smooth function with compact support and define 

£i,„(x) := G[N(xi - x i+ i),N(x i+ i - x i+2 ), ■ ■ .,N(x i+n -i - x i+n )\. (2-19) 

5ei t > R 2 N £ . Then for any J C {1, 2, . . . , N — n} we /laue 

/ ^££,n(x)/ T d,x- / l£>,„(x)d M | < CiV- 1 /^^ 2 ^ 5 /^ 1 / 2 . (2.20) 
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We emphasize that Theorem 2.1 applies to all (3 > 1 ensembles and the only assumption concerning the 
distribution f T is in (2.18). The potential W is chosen to satisfy the two convexity properties: (2.17) and 
(3.7). Potentially, there are many other choices for W. For example, without changing the form of W given 
in (2.10), a more natural choice for jj would be 




This would somewhat improve the estimate (2.22), but the analysis is more complicated and we will not 
pursue this choice in this paper. 

Returning to the original Wigner ensembles with /3 = 1, we can estimate the critical constant A in the 
following lemma, to be proved in Section 4. We assume that the measure dv satisfies the logarithmic Sobolev 
inequality (LSI), i.e. there is a constant 6 such that for any nonnegative function u with J u dv = 1 we have 

J ulogu dv < 9 J |VV"| 2 d^ . (2.21) 

We remark that (2.21) implies that v has a Gaussian decay. 

Lemma 2.2 Suppose the assumption (2.21) on the distribution v of the matrix elements hold. Then the 
constant A in (2.18) can be estimated as 

A < C a ?r 2 N^ 9+a (2.22) 

for any a > 0. 



Now we state the universality of the local eigenvalue statistics for Wigner matrices. To formulate the 
result in terms of correlation functions, it is convenient to remove the ordering x\ < X2 < ■ ■ ■ < xn among the 
eigenvalues and work with symmetric densities. Let pn{xi,X2, ■ ■ ■ ,xn) denote the (symmetric) probability 
density of eigenvalues and for any k = 1, 2, . . . , N let 

p ( $(xi,X2,...Xk) ■= / p N {xi,x 2 ,.-.,x N )dx k+ i...dx N (2.23) 

(k) 

be the fc-point correlation function. In the case of GOE, we denote the density functions by p N GOE and it 
is well known that for \u\ < 2 the limit 

(*0 (at a k \ 

A:GOE{u+ T f- 1 - v ---,u+——) (2.24) 



[ Qsc {u)] kIN > GOE \ Ng sc ( U y-' Nq sc (u) 
exists and can be explicitly computed (see, e.g. Section 7 of [18]) 



Theorem 2.3 Suppose the distribution v for the matrix elements satisfies the assumptions (2.21). Assume 
that the density v(x) = e~ u ^ satisfies 

m 

Y,\U U) {x)\<C(l + x 2 ) 1 (2.25) 
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for some fgN and m £ N sufficiently large. Let k > 1 and O : R k — > M 6e a continuous, compactly supported 
function. Then for any \u\ < 2, we have 



lim lim — / / 0(ai, . . . , a fc ) 
S-M) N-yco 2d J u _ s J Rk 

x T TTifc (^iv ) ~ P { ngoe) ( v + m —< \ ' •••>" + T7— Vt ) d «i • ■ • dafcdu = 0. 

[g S c{v)\ L V iv.^^yy Ng sc {v) Ng sc {v)J 



In the following Corollary we remove all assumption from Theorem 2.3 except for the decay condition 
and a technical condition that v is supported in at least three points. This latter technical condition was 
removed in our later paper [13], where we generalized our approach to a broader class of random matrix 
ensembles. 

Corollary 2.4 Suppose the distribution v of the matrix elements has mean zero, variance one and a tail 
with subexponential decay, i.e. it satisfies that 

J e |x|c d^(x) < oo (2.27) 

for some constants c > 0. Assume that v is supported in at least three points. Then the conclusion of 
Theorem 2.3 holds. 



Proof. Let rrij denote the moments of v 




where mi = and 7712 = 1. It is easy to check that > m§ + 1 with the equality holds only for Bernoulli 
type distribution supported in two points. Let v be the probability measure on M. with density 

e~ u M = (const.)e~ x2 / 2+F ^ 

where F is a smooth function with compact support. For the fixed numbers 013, m4 satisfying 7714 > m§ + 1, 
there exists an F such that the first four moments of v match to those of dv, i.e. 

/ x dp = 0, / x 2 dP = 1, / x- 1 dt> = rrij, j = 3, 4. 
Js. Js. Js. 

Here ||.F||oc and the support of F depend only on m.3,7714. Clearly, v satisfies the- assumption of Theorem 
2.3 and thus (2.26) holds for the measure v. Recall that Theorem 15 in [22] states that the local eigenvalue 
statistics for matrices whose matrix element distributions match up to the first four moments are the same 
in the limit N — ¥ 00 (strictly speaking, this theorem was stated only for hermitian matrices, but the parallel 
version for the symmetric matrix holds as well, see the remark at the end of Section 1.6 [22]). This proves 
the Corollary. rj 



Theorem 2.3 is a simple corollary of Theorem 2.1, Lemma 2.2 and the method of the reverse heat flow 
[11]. Lemma 2.2 is a technical result concerning some rough estimates of eigenvalues. Though we stated the 
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universality in terms of correlation functions, it also holds for the eigenvalue gap distribution and we omit 
the obvious statement (the analogous statement for the Hermitian case was formulated in Theorem 1.2 of 
[11])- 

We now sketch the key new ideas of this article. 



I. The key concept is the introduction of the local relaxation flow (2.14) which has the following two 
properties: (1) The invariant measure for this flow, the local relaxation measure to will be shown to 
have the same local eigenvalue statistics as the GOE. (2) The relaxation time of the local relaxation 
flow is much shorter than that of the DBM, which is of order one. 

II. Suppose we have a density q w.r.t. u). Then, by differentiating the Dirichlct form w.r.t. u under the 
local relaxation flow, we will prove that the difference between the local statistics of qui and to can be 
estimated in terms of the Dirichlet form of q w.r.t. ui. Hence if the Dirichlet form is small, the local 
statistics of qui is independent of q. 

III. It remains to show that the Dirichlet form of q = ftH w.r.t. tu is small for t sufficiently large (but still 
much less than order one). To do that, we study the evolution of the entropy of f t fi relative to u>. This 
provides estimates on the entropy and Dirichlct form which serve as inputs for the Step II to conclude 
the universality. 



3 Local Relaxation Flows and Entropy Dissipation Estimates 

The first ingredient to prove Theorem 2.1 is the analysis of the local relaxation flow which satisfies the 
logarithmic Sobolev inequality and the following entropy dissipation estimate. 

Theorem 3.1 (Dirichlet Form Dissipation Estimate) Suppose (2.17) holds. Consider the equation 

d t q t = Lq t (3.1) 

with reversible measure uj. Then we have 

dtDUV^Tt) < -C^DUVQi) " 2^2 / E (^hv (dtVft-diV*) 2 ^ (3-2) 

\i-j\<N V {Xl 3 > 

1 f°° f 1 

oat* / ds E (x . _ x .)2 (diVto-d jy /& 2 du < DUV^) (3-3) 

\i-j\<Nri 1 1 3> 
and the logarithmic Sobolev inequality 

SUq)<Cr, 1/3 DUVv) (3-4) 
with a universal constant C . Thus the time to equilibrium is of order 77 1 / 3 and we have 

SM<e- ct ^ 1/3 SUqo)- (3.5) 



8 



The proof given below follows the argument in [1] and it was outlined in this context in [10]. The 
new observation is the additional second term on the r.h.s of (3.2), corresponding to "local Dirichlet form 
dissipation". The estimate (3.3) on this additional term will play a key role in this paper. 

Proof. Recall that from the derivation of the LSI (see, e.g. Section 5.1 of [10]), with h = ^fq we have 

1 



d t h = Lh+^h-\Vhf 



and 



2N 2 



V/i(V 2 H)V/ie" w dx. 



In our case, (2.17) and the fact that 



(3.6) 



^( log(|z|+?7)-logH) >0, x>0, 



imply that the Hessian of 7-L is bounded from below as 



\i-j\<Nr, v J > 



{d l h-d 3 hf 



with some positive constant C. This proves (3.2) and (3.3). Inserting the inequality 

dtD^^t) < -CiT UZ D u {y/to) 

from (3.2) into the equation 

and integrating the resulting equation, we prove (3.4). Inserting (3.4) into (3.8) we have 

d t SM < -Crf^SM 

and we obtain (3.5). 



(3.7) 



(3.8) 



□ 



Remark 3.2 The proof of (3.6) requires an integration by parts and the boundary term atxi = Xj (explained 
in Section 5.1. of [10]) should vanish. In the Appendix we will justify this technical step. 

Lemma 3.3 Suppose that the density qo satisfies S UJ (qo) < CN m with some m > fixed. For a fixed n > 1 
let G : M" — > K be a bounded smooth function with compact support and recall the definition of Qi n from 
(2.19). Let J C {1, 2, . . . , N - n} and set t = rf^N 6 . Then we have 



J ^£^,r l (x)dw- / -^^2G hn {x)q duj 

J l£j J l£j 



\l DU f° )T +Ce-^ 



(3.9) 



with some constant C depending on G. 
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Proof. Without loss of generality, we consider only the case J = {1, . . . , N}. Let qt satisfy 

d t q t = Lq t 

with an initial condition qo . Wc first compare q T with qoo = 1. Using the entropy inequality 

\q-l\du<y/S u (q) 
and the exponential decay of the entropy (3.5), we have 

/ ^ 9r<k> - / ^E^ x ) d H <C{N m e~^' 1/3 ) 1/2 <Ce- cN '. 

i i 

To compare qo with q T , by differentiation, we have 

J -^^2Si, n (x)q T d(J - J -^^£ ? ,„(x)q dcj 

i i 

= I ds I — d k G(N(xi - x i+ i), N(x i+n -i - x i+n )) [d i+k -\q s - d i+k q s ]du>. 

*' ^ i fc=l 

From the Schwarz inequality and dq = 2^/qd^/q the last term is bounded by 



fc=i 



-,1/2 



1/2 



1 



iv 2 4^ (a;i+fc-i - £i+fc) 2 



-P^(y / go)r 
N 



(3.10) 



where we have used (3.3) and that 

d k G(N(xi - x i+ i), . . . , N(x i+ k-i -x i+k ),...N(x i+n -i - x i+n j) (x i+fe _i - x i+k ) 2 < CN~ 2 , 

since G is smooth and compactly supported. This proves the Lemma. rj 

Notice if we use only the entropy dissipation and Dirichlct form, the main term on the right hand side of 
(3.9) will become V St. Hence by exploiting the Dirichlet form dissipation coming from the second term on 
the r.h.s. of (3.2), we gain the crucial factor iV -1 / 2 in the estimate. 

The second ingredient to prove Theorem 2.1 is the following entropy and Dirichlet form estimates. The 
first estimate (3.11) should be compared with the bound S^{ft) < CN 5 / 4 for t > TV -1 given in Lemma 5.1 
of [10]. 
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Theorem 3.4 (Entropy and Dirichlet Form Estimates) Suppose the assumptions of Theorem 2.1 hold. 
Let t = r\ 1 l' i N E and let g t = ft/ip so that S fl (ft\ip) = S u (gt). Then the entropy and the Dirichlet form satisfy 
the estimates: 

SU9r/ 2 ) < C?T 5/3 A, (3.11) 
D u {y/g?) < 2r- 1 ^(.g T/2 ) + Crf 2 < Crf 2 A. 



(3.12) 



Proof. First we need the following relative entropy identity from [24]. 



Lemma 3.5 Let ft be a probability density satisfying dtft = Lf t . Then for any probability density %\j t we 
have 

dtS^fMt) = f(9 j V9l) 2 ^td(i+ f g t (L-d t )i> t dfi , 

j J J 

where g t = f t /ip t . 

In our setting, ip is indep of t and L satisfies (2.15). Hence we have 



dtSM = dtS^ft^) 



N 



j \ d jVdt) 2 duj + J Lg t duj + "^2 j b j9 3 gt duj. 



J 3 

Since the middle term on the right hand side vanishes, we have from the Schwarz inequality 



dtSM < -D u (^g7) + CNY, J tfgtdu. 



Together with the LSI (3.4) and (2.18), we have 

dtSM < -Cif^SM + c v - 2 a. 



(3.13) 



(3.14) 



for t < t. Since S u (g ) = 5 M (/ |^) < N m and r/2 > i] 1 / 3 , the last inequality proves (3.11). 

Integrating (3.13) from t = r/2 to t = r and using the monotonicity of the Dirichlet form in time, we 
have proved (3.12) with the choice of r. 

□ 

Proof of Theorem 2.1. Fix r = n 1 l z N e and let qo = g T = f T /tj} with / satisfying the assumption of 
Theorem 3.4. Using (3.12), we have 



N 



< CN- 1/2+E/2 n- 5/6 A 1/2 



and from (3.9) we also have 

/ Jf^iMf^-J ^E^,«( X H ^CN-VWfi-WAW + Ce-*"'. (3.15) 

i i 

Clearly, equation (3.15) also holds for the special choice fo = l (for which f T = 1), i.e. local statistics of fx 
and ui can be compared. Hence we can replace the measure w in (3.15) by fi and this proves Theorem 2.1. rj 
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Denote by 

ctj = aj(t) = E, t Xj (3.16) 
the expected location of Xj w.r.t ft- We now prove Theorem 2.3. 

Proof of Theorem 2.3. With the choice A = C e ?T 2 iV 8 / 9+e in (2.18) and recalling that r) > jV" 1 / 3 ^, 
we have 

^-1/2+^-5/6^/2 <<; L 



Thus we have proved that 



lim 



/ Jf E &Ax)frdn - / ^ E GiA*W\ = 0. (3.17) 



ieJ ieJ 



We now prove (2.26) for k = 2; the other cases are similar. Since we integrate over v, it suffices to consider 
only observables of the form 0(ai — 0:2) with O being bounded and compactly supported. Denote g sc (u) = a 
and we have 

f U+S I 0(a)p%> (v, v + jt^-tt) da = £ £ £ / /(x)0[JVo(xi - ^)]d/i + o(l) (3.18) 

Ju_l5 JR Qsc(V) i:\x,-u\<8 j:\x ] -u\<8 J 

where o(l) — > as JV - >oo,<5— >0 and fd^x is the probability distribution of the eigenvalues of the random 
matrix. 

Proposition 4.4 from Section 4 asserts that max m \ a m (t) — j m \ — > as N — > 00 both for t = and 
t = t. Thus we can replace the conditions |a;fc — u\ < 8 for k = i,j in the summations with the condition 
k G J = [7Vra sc (u — 5), Nn sc (u + S)] at the expense of negligible errors. By using the method of Proposition 
2.1 of [11] and that the derivatives on U are controlled by (2.25) for m large, we can find an initial data f 
such that the difference between the expectation on the right hand side of (3.18) w.r.t. / and f T = e TL f is 
negligible. The distribution / can be found by approximating the reverse heat flow e~ rL f by a sufficiently 
high order Taylor polynomal of e~~ rL as outlined in Remark 1.1 of [11]. 

Thus wc only have to compare 



N 

with the GOE case. We can rewrite the last term as 



! im 7? E / fr{^)0[Na{x l - Xj)}dfj, 

/— >-oo iv . — J 



lim lim ^EE I ?T(*)0[Na{ Xl -x t+n )W. (3.19) 

ieJ n<M J 



The observable in (3.19) is of the form Gi,n(x-) for an appropriate G. From Theorem 2.1, the limit (3.19) is 
unchanged if we replace frd/i by d/j,. Here the assumptions of Theorem 2.1 w.r.t. / can be easily checked 
from the explicit construction in [11]. Note that the entropy assumption in Theorem 2.1 follows from the 
LSI with m = 2. This proves Theorem 2.3. rj 
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4 Proof of Lemma 2.2 



Let P t and Ej denote the probability and expectation with respect to the measure / t /i. We first collect the 
assumptions we need in this section. 

1. Logarithmic Sobolev inequality (LSI) for the distribution v t of the matrix elements. Since LSI holds 
for v (2.21), it also holds for v t as well; see Lemma B.l in Appendix B for a proof. 

2. Upper bound on the number of eigenvalues Mi in an interval of length |/| > (log N) 2 /N: 



¥ t (Mi > KN\I\) < Ce- C ^ KN ^ (4.1) 

for any K > Kq with a sufficiently large fixed Kq. This bound was proved in Theorem 4.6 of [9] 
under the condition of Gaussian decay that follows from LSI (2.21). The conditions y > (log N)/N 
and |/| > (log N)/N were unfortunately erroneosly stated in Theorem 4.6 of [9]; the proof given there 
requires that y > (log N) 2 /N and |/| > (log N) 2 /N . Under a stronger condition on v (see remark after 
Lemma 4.7 of [9]), the statement of Theorem 4.6 of [9] holds for y > (log N)/N and |/| > (log N)/N. 

3. (Theorem El of [10]) Let \E\ < 2. Denote by x a the largest eigenvalue below E and assume that 
a < N—l. Then there exist positive constants C and c such that for any M that satisfy c(log N) 4 /(2 — 
\E\) <M< CN(2 - \E\), we have 

Pt(x Q+1 -E>^, a<N-l) <C e -cP-l*l] 8/ Va? (4 2) 

4. (Equation (4.15) of [10]) Let 

n(E) = ^m{xj < E}. (4.3) 
Then there exists a constant C > such that 

/oo ft 
\n(E) ~ n sc (E)\dE < j^rjj . (4.4) 

The real assumption behind these statements is LSI that was used in the proof of (4.2) and (4.4). Since 
the parameter t will not play any role in this section, we will drop the index from the notation, i.e. we use 
E = E t , P = P t etc. 

Proposition 4.4 below estimates the distance of the eigenvalues from their location given by the semicircle 
law. This will justify that the convex extension of the potential Wj affects only regimes of very small 
probability. Recall the definition (3.16) of ctj = atj(t) for the expected location of the j-th eigenvalue w.r.t 
/t/i. We start with an estimate on the expected location of the extreme eigenvalues: 

Lemma 4.1 Suppose that the probability measure v of the matrix entries satisfies 

e c ^dv(x) < oo (4.5) 



for some c > (this condition is satisfied, in particular, under LSI (2.21)). Then for any 5 > we have 

- 2 - CN- 1/4+s < at < a N < 2 + CN- 1/i+s (4.6) 

with some C depending on 5. 
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Proof. For any M, define the probability measure Cm on R with density 

Z^e- U ^l(\x\ < M). 

Setting M = N~ and using (4.5), the total variational norm between v and Cm is bounded 

HCM-Hloo<Ce- cA,a . 

Denote by Cm = (m* ( uN res P-) the probability law of the random matrices whose matrix elements are 
distributed according to Cm {y resp.). Then the total variational norm between Cm an d v N is bounded by 

HCM-^lloo<CiV 2 e-^. 

Since up to exponential small probability, the largest eigenvalue a; at is bounded, the expectations of xn w.r.t. 
these two measures satisfy 



< CN z e 



2„-cN" 



From Theorem 1.4 of [23], we have 



E c nx n < 2 + CAf 1/2 7V- 1/4 logiV < 2 + N~ 1/i+s 
for any 5 > 0. Thus we have proved that 

a N = E u nx n < 2 + CN- 1 '^ 5 . 
Similar estimate holds for a.\. rj 

Next wc estimate the fluctuations of Xj : 
Proposition 4.2 For any e > we have 

P^maxl^- -aj\> N~ 1/2+e ^ < C e e~ cN ' . (4.7) 

Proof. Using (2.21) and the Bobkov-Gotze concentration inequality, we have for any T > 

maxl^ -Exj\ > 7) < 2iVmaxP(xj — Exj > 7) < 2Ne~~< T WL&TtEe eT ^ x ^ < CWe"^ 



after optimizing for T and using that | Vxj \ < CN 1 ' 2 (see the analogous calculation in the proof of Theorem 
3.1 of [8]). This proves (4.7). n 

The following proposition is a refinement of Proposition 4.2: 
Proposition 4.3 We choose M, K, k positive numbers, depending on N, such that for some small 5 > 

N~ 1/6+2S < k < N~ 5 , Mn 6 >N 25 , K<cNk 3/2 and c(\ogN) 4 /n < M < CNk (4.8) 
with some sufficiently small c > and large C > constants. Let 

$ := 2N- 1 / 2+25 K- 1 ' 2 + (4.9) 
14 



Then for any CNk 3 ' 2 < i < CN(1 - k 3 ' 2 ) we have 

r(\ Xi - ai \>^j <Ce- cN \ 

We also have 

i N 

i=l 

TTie constants C may depend on S but are independent of N. 



(4.10) 



(4.11) 



Proof. Consider an index i with CNk 3 / 2 < i < N(l - Ck 3 / 2 ), then |2 - |-y* 1 1 > Ck. We first show 
that |2 — \xi\\ > Ck 2 with a very high probability. Suppose, in the contrary, that Xi < —2 + Ck 2 for 
some / > CNk 3 / 2 (the case Xi > 2 — Ck 2 is treated analogously). From (4.6) and (4.7) it follows that 
x\ > -2- CN-V i+S with a very high probability. But then the interval [-2 - CW~ 1/4+<5 , -2 + Ck 2 ] 
of length CN~ 1 / 4+s + Ck 2 <C k 3 ^ 2 would contain CNk 3 / 2 eigenvalues, an event with an extremely low 
probability by (4.1). 

Knowing that |2 — \xi\\ > k 2 with a very high probability, we can use (4.2) to conclude that for any index 
i with CNk 3 / 2 <i<N(l- Ck 3 ' 2 ) we have 



Xi > 



M 
~N 



< e 



by (4.8). Then 



— E 

2K+1 



> 



\j-i\<K 



KM 
~N~ 



< Ce 



-cN° 



(4.12) 



(4.13) 



Similarly to the calculation in Theorem 3.1 of [8], by using the logarithmic Sobolev inequality (2.21), we 
have 



i— £ 

\ 2K + 1 ^ 

\ j : .; i<K 



— Y Exj > N~ 1/2+2S K~ 1/2 \ < Ce~ cN6 

2K+1 ,f-^ J ~ / ~ 



(4.14) 



J" : \j-i\<K 



This bound holds for any index i with the remark that if i < K or i > N — K , then the averaging over the 
indices j is done asymmetrically. 

Combining this estimate with (4.13) we have, apart from a set of very small probability, that 



-J— E 

2K + 1 ^ 



■J ■ \i-i\<K 



< $ 



(4.15) 



for any CNk 3 / 2 < i < N(l — Ck 3 / 2 ). Taking expectation, and using that P(max 4 \n\ > K) < e NK for any 
sufficiently large K, we also obtain for these i indices that 



2K 



(4.16) 



Subtracting the last two inequalities yields (4.10). Finally, combining this bound with the estimate (4.7) for 
the extreme indices, we obtain (4.11). rj 
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Proposition 4.4 For any small 5 > and for any j = 1, 2, . . . N we have 

P(|a?j -Til > CA^ 1/5+5 ) < Ce~ cN& (4.17) 

and we also have 

max |a m - 7,„| < CN~ 1/5+s . (4.18) 

The constants C may depend on 6 but are independent of N. 

We remark that in the bulk a m — j m is expected to be bounded by 0(N~ 1+£ ) (in the hermitian case 
it was proven in [15], see also [22]); near the edges one expects a m — y m ~ 0(N~ 2 / 3 ). Our estimate is not 
optimal, but it gives a short proof that is sufficient for our purpose. 

Proof of Proposition 4.4. We define 

n{E) := < E} 

to be the counting function of the expected locations of the eigenvalues. We compare n(E) with n{E) defined 
in (4.3). Using the fluctuation bound (4.7), we have 

n(E - N' 1 ' 2 ^) - Ce- cNC < n(E) < n(E + N' 1 ' 2 ^) + Ce~ cN * 

for any E £ K. In fact, the upper bound on the density (4.1) guarantees that n(E) and n(E) are Lipschitz 
continuous on any scale much bigger than (log N) 2 /N, i.e. 

\n{E) - n(E)\ < CN^' 2+E + Ce~ cNe < CN'^ 2+£ (4.19) 



for any E. 
Wc write 



For the first term 



N 

Y \ a J - 7il = Y ( a o - 7i) + Y (Tj - <*])• 
j=i r-ot}>ij r-aj<n 



Y (°i ~ Ti) = f dE Y <E< aj )= [dE Y ^ i > n -(#) 



=N [ dE l(n sc (E) < n(E))(n sc (E)-n(E)) 



(4.20) 



and the second term is analogous. Wc thus have 

iV 



i=i 

By Lemma 4.1 



i N r 



/ \n(E) - n sc (E)\dE 

J\E\>3 
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For the energy range \E\ < 3, we use (4.19): 



\n(E) - n sc (E)\dE < CN- 1/2+e + / \n(E) - n sc (E)\dE 
\E\<3 J\E\<3 (4.22) 



< CN -l/2+e 

from (4.4). Thus we obtain from (4.21) 



1 N 



TV 
i=i 



with an e dependent constant. 



To estimate \a m — 7 m |, we can assume, without loss of generality, that a m > j m , the other case is treated 
analogously. Let A > be a parameter that will be optimized later. Set too = [CNX 3 ^ 2 ] with a sufficiently 
large constant C. Since n sc {— 2 + 5) ~ <5 3//2 , the parameter A is roughly the energy difference from the edge 
to the TOo-th eigenvalue. 

First we consider an index to such that too < to < N — too. For a small positive number £, define 

S := {j : 7j G [7,„,7 m + £]}. 
From the property n, sc (— 2 + 5) ~ <5 3 / 2 for small <5, we have 

|5| > cJVtt 1/2 . 

Now set £ = min{i|o! m — 7 m |,cA} with some small positive constant c. Since for all j E S 

otj - 1] > ot m - (7 m + £)> ^{a m - 7,„) > i. 

we have 

Ek-7,i>E^ c ^ 2a1/2 - 

Combining this estimate with (4.23), we have 

Assuming that A > CN~ x / b+2e / b , we see that £ = ^\a m — 7 m | and we obtain 

|am-7m| <C^V" 1/5+e (4.24) 

for any to with Too < m < N — niQ . 

For the extreme indices, we use that if to < too, then from Lemma 4.1 and (4.24), we have 

-2 - CW- 1/4+i < ai < a m < a mo < 7mo + C7V- 1/5+£ < -2 + CX + CN-^ 5+£ 

and 

-2 < 7m < -2 + CX 
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for all m < mo. Thus 

|a m -7m| < CX + CN- 1 '^ 

with C depending on e. Similar estimates hold at the upper edge of the spectrum, i.e. for m > TV — mo. 
Choosing A = CN~ 1 / b+e , we conclude the proof of (4.18). The proof of (4.17) then follows from (4.7) and 
this concludes the proof of Proposition 4.4. rj 

The following Proposition is a strengthening of the bound (4.23) used previously. 
Proposition 4.5 With the notations of Proposition 4-3, we have 

i 

Proof. We proceed similarly to the proof of Proposition 4.4 but we notice that in addition to (4.19), a 
stronger bound on \n(E) — n(E)\ is available for E € I := [E- t E+], where E± := ±(2 — Cqk), with some 
large constant Cq. Here k is subject to the constraints in (4.8). To obtain an improved bound, note that for 
any E in this interval 



W =ij2 E1 & ^E E <E + ®) + Ce- cN& = n(E + $) + Ce 



-cN 5 



N w - / - N 

3 3 



(4.26) 



To sec this inequality, define the random index 

jo = Jo(E) := max{j : Xj < E} = ^ lfo < E) 

j 

and the deterministic index 

jx = jx(E) := max{j : aj < E + $} = ^ l(a,- < E + $). 

3 

The estimate (4.26) will then follow if we prove that jo < ji, i-e. ttj < E + <&, with a very high probability. 
By (4.17) we have, with a very high probability, that 

7i0 - CN-^+ & < x jo <E< x j0+ x < 7j0+ i + CN-^ 5+s < 7i0 + CN-^+ s . 

Therefore, with a very high probability, jj is in the CN~ 1 ^ 5+S vicinity of E G /, and thus CTVk 3 / 2 < jo < 
CTV(1 — k 3 / 2 ) holds for any fixed C if Co in the definition of E± is sufficiently large. Thus \xj — ctj \ < $ 
with a very high probability by (4.10), so Xj < E implies ctj < E + $ and this proves (4.26). 
The proof of the lower bound 

n{E) > n{E - $) - Ce~ cN ' 
is analogous. Finally, by the Lipschitz continuity of n(E) on a scale bigger than (log TV) 2 / TV , we have 

\n(E)-n{E)\<C$, VEe[E-,E+], (4.27) 

where we also used that $ > Cexp(— cN s ). 
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Define the interval J = [-2 - CN' 1 ^, 2 + CN 1 / 4 ^}. Using (4.21), we have 



with 



1 N f 

jj E l a i - Til < GO + (^) + ™ + _/ K^) - » sc (£)|di? (4.28) 



a) 



[ \n(E)-n(E)\dE, (II) := [ \n(E) - n(E)\dE , (III) := I \n(E) - n(E)\dE. 
Ji J.j\i J j c 



From (4.27) and (4.19), we have 

(7) < C$, (II) < CN~ 1 / 2+s \J \ I\ < CkN-^ 2+s 

since | J \ I\ < Cn + N' 1 / 4 ^ < Ck. Finally, n(E) = for E < -2 - CA^ 1 / 4 ^ and n(E) = 1 for 
E > 2 + CW~ 1//4+l5 by (4.6). Combining these estimates with the fluctuation (4.7) and with the fact that 
P(max|x;| > K) < Ce~ KN for all large K, hence n(E)(l — n(E)) decays exponentially for large \E\, we 
obtain 

(III) < Cer cN . 

Collecting all these estimates, inserting them into (4.28) and using (4.4), we conclude the proof of Proposition 
4.5. n 

Proof of Lemma 2.2. We assume (2.21), (4.1), (4.2) and (4.4). We choose M,K,k positive numbers 
satisfying (4.8) that will be optimized at the end of the proof. 

Suppose now that Xj G Ij (recall the definition of Ij from (2.10)). Then, from the definition of bj (2.15), 
wc have 



3 N ^ 

k : \k-j\> 

Notice that function g(x) := -|jj!r^~ satisfies 



sgn(xj-x k ) sgn(xj-j k ) 



\Xj-X k \+T) \Xj-J k \+V 



(4.29) 



\g(x)-g(y)\<v-'\x-y\ (4.30) 

as long as x and y have the same sign. In our case, Xj — x^ and Xj — have the same sign as long as 
\xk — 7k | < \xk — Xj\, Applying the upper bound on the local density (4.1) for a grid of intervals of size 
N~~ 1+s . one can easily conclude that for any j, k with \j — k\ > Nr], 

F(\ Xj -x k \< V N- S ) < Ce- cNS/2 
Combining this estimate with (4.17) and assuming 

r) > N~ 1/6 , 

we obtain that Xj — x k and Xj — jk have the same sign apart from a set of very small probability. Applying 
(4.30), we have 

P(N > C V - 2 ^ J2 \^-lk\) < Ce~ cNS/2 (4.31) 

\k-j\>N v 
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By (4.11) and (4.25), after recalling the definition of $ (4.9), we have 

Jf E \**-T>\Zjr E [\a k -J k \ + \x k -a k \]<c( K N-^ + N-^K-^ + ^-) 

\k~j\>N v \k-j\>N v 

with a very high probability. Taking (4.31) into account, we can thus bound bj by 

KM\ 



M < C V - 2 (kN-^ 2+s + N -^+^ k -i/2 + 



with a very high probability. On the complement event we estimate \bj\ by r\ 1 < N. Recalling that P = 
denoted the probability with respect to /t/n, this gives 



with 



A := r,- 2 N 2 (kN- 1 /™ + AT-l/2+2^-1/2 + KM_^ 2 
Optimizing the parameters under the conditions (4.8) we can set 

K = iV-VM+J M = iV l/3 ) R = N l/9 

and finally A < C^ 2 N s / 9+4S . 

Suppose now that Xj ^ Ij. By (4.17) and \bj\ < r)~ l < N we have 

^E / tf 1 ^ t d ^ ^ Ce ~ cNS ( 4 - 32 ) 

This proves Lemma 2.2. rj 

A Some Properties of the Eigenvalue Process 

In the main part of the paper we did not specify the function spaces in which the equations (2.6) and (3.1) 
are solved. In this appendix we summarize some basic properties of these equations. In particular, we justify 
the integration by parts in (3.6). For simplicity, we consider the most singular /3 = 1 case only. 

The Dyson Brownian motion as a stochastic process was rigorously constructed in Section 4.3.1 of [14]. 
It was proved that the eigenvalues do not collide with probability one and thus (2.6) holds in a weak sense 
on the open set Eat. The coefficients of L have a (xi — Xj)" -1 singularity near the coalescence hyperspace 
xi = xj. We focus only on the single collision singularities, i.e. on the case j = i ± 1. By the ordering of the 
eigenvalues, higher order collision points form a zero measure set on the boundary of Sat and can thus be 
neglected. In an open neighborhood near the coalescence hyperspace Xi = xi+i, the generator has the form 

L = ± (d 2 + + - *)) + L reg = ± (d 2 + 91 + h u ) + L reg , u > 0, 
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after a change of variables, v = \(xi + ZEj+i), u = 5(2^+1 — Xi), where L reg has regular coefficients. The 
boundary condition at u = is given by the standard boundary condition of the generator of the Bessel 
process, d 2 + -^d u , which is uf'(u) — > as u — >• 0+. Thus L is defined on functions / S C 2 (Ejv) with 
sufficient decay at infinity and with boundary conditions 

lim (x i+1 -Xi)(d i+ i -di)f -> (A.l) 

for each i. 

The generator L of (3.1) differs from L only in drift terms with bounded coefficients, hence the boundary 
conditions of L and L coincide. Finally, we need some non- vanishing and regularity property of the solution 
of (3.1): 

Lemma A.l Let SI C be a bounded open set such that 

Hn (J {x : Xi = Xj} = 

*<i : (m)tKi,2) 

i.e. f2 intersects at most one of the coalescent hyperplanes, namely the {x\ = X2}. Then any weak solution 
qt(x) of (3.1) with boundary conditions (A.l) is C 2 on (t,x) £ i?+ x and for any t > we have 

< inf q t < sup q t < 00 
n n 

Proof. The statement follows from regularity properties of the Bessel process with generator d 2 + ^d u . 
In a small neighborhood of the coalescence line x\ = X2 one can introduce a local coordinate system (u, y) = 
$(x), where u = 5(2:2 — x\) > 0, y € R* -1 , so that, in the case for GOE, 



~ _ 1 
~ 4A 



9« + 

u 



where L reff is an elliptic operator with second derivatives in the y variables and with bounded coefficients 
on the compact set ^(ri). The solution in the new coordinates is q t (u,y) = q t ($ _1 (u, y)). Introducing a 
function qt(a, b, y) := qt(V a 2 + b 2 , y) defined in N + 1 variables, we see that % satisfies dtqt = Lq t , where 



N 



dl + dl 



i.e. L is elliptic with bounded coefficients in the new variables. Notice that the boundary condition (A.l) 
implies that, in the two dimensional plane of (a, &), the support of the test function for the equation dtqt = Lqt 
is allowed to include the origin (0,0). 

By standard parabolic regularity, we obtain that the solution is C 2 and is bounded from above and below. 

□ 

This lemma justifies the integration by parts in (3.6). Since q £ C 2 and it is separated away from zero, 
h = Jq has no singularity on the coalescence lines. Since the function exp(— T-L) vanishes whenever Xi = Xj 
for some i ^ j, the boundary terms of the form 



f 3 2 ^ e^dx 

J X ; —X , 



in the integration by parts vanish. 
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B Logarithmic Sobolev inequality for convolution measures 

Lemma B.l Suppose K and H are two probability densities on R so that the LSI holds with constants a 
and b, respectively. Then LSI holds for their convolution K * H as 

f(x)logf(x)K*H(x)dx<max(a,b) / {V^f{x)) 2 K * H(x)dx (B.l) 



for any f with J f(x)K * H{x)dx = 1. Here V = d/dx. 

Proof. The following proof is really a special case of the martingale approach used in [17] to prove LSI. 

Let 

9(y) = / f(x)K(x - y)dx. 



Then the left side of (B.l) is equal to 

f(x) log[f(x)/g(y)] + f(x) logg(y) K(x - y)dxH(y)dy. (B.2) 



For any fixed y, from the LSI w.r.t. the measure K(x — y)dx, the first term on the right hand side is bounded 

by 



a / / {^^W)) 2 K{x-y)dxH{y)dy. 

JR JR 

Since J M gH = 1, the second term in (B.2) is estimated by 

f g(y)logg(y)H(y)dy < b f (V ' ^gjy)) 2 H(y)dy = - [ g(y)- 1 ( f f{x)V v K{x - y)dx) H(y)dy. 
Jr Jr 4 Jr \Jr / 

Integrating by parts, we can rewrite the last term as 

7 / f'(x)K(x-y)dx) H(y)dy<b f f {V^J(x)) 2 K(x - y)dxH(y)dy, 



where we have used f'(x) = 2 yfixj V yfjx) and the Schwarz inequality Combining these inequalities, we 
have proved the Lemma. rj 

Acknowledgement: We thank Jun Yin for several helpful comments and pointing out some errors in the 
preliminary versions of this paper. 
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